Optimal measurements for relative quantum information 



Stephen D. Bartlett/'Q Terry Rudolph, 2 -0 and Robert W. Spekkens 3 ^ 

1 School of Physical Sciences, The University of Queensland, Queensland 4 072, Australia 
^Optics Section, Blackett Laboratory, Imperial College London, London SW7 2BZ, United Kingdom 
Perimeter Institute for Theoretical Physics, Waterloo, Ontario N2J 2W9, Canada 

(Dated: 6 July 2004) 

We provide optimal measurement schemes for estimating relative parameters of the quantum 
state of a pair of spin systems. We prove that the optimal measurements are joint measurements 
on the pair of systems, meaning that they cannot be achieved by local operations and classical 
communication. We also demonstrate that in the limit where one of the spins becomes macroscopic, 
our results reproduce those that are obtained by treating that spin as a classical reference direction. 
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I. INTRODUCTION 

Whenever a system can be decomposed into parts, a 
distinction can be made between collective and relative 
degrees of freedom. Collective degrees of freedom de- 
scribe the system's relation to something external to it, 
while the relative ones describe the relations between its 
parts. Encoding information into collective degrees of 
freedom is problematic in situations where the parts of 
the system are subject to an environmental interaction 
that does not distinguish them (collective decoherence), 
or if the external reference frame (RF) with respect to 
which they were prepared is unknown, or if a superselec- 
tion rule applies to the total system [U-013 ^ n contrast, 
encoding information preferentially into the relative de- 
grees of freedom has been shown to offer advantages in 
these situations, with applications in quantum computa- 
tion 0, , communication [|| Q and cryptography 0, |j| . 

If the relative encoding is not perfect or is itself subject 
to some noise, it becomes important to identify measure- 
ment schemes for estimating relative parameters. Such 
measurements have been discussed recently in connec- 
tion with their ability to induce a relation between quan- 
tum systems that had no relation prior to the measure- 
ment, e.g. , inducing a relative phase between two Fock 
states pji El or a relative position between two mo- 
mentum eigenstates fl2^ . Also, measurements of relative 
parameters are critical for achieving programmable quan- 
tum measurements 0, El • 

Schemes for estimating relative quantum information 
are also interesting in their own right. They include 
such natural tasks as estimating the distance between 
two massive particles, the phase between two modes of 
an electromagnetic field, or the angle between a pair of 
spins. In this paper, we focus on this last example: opti- 
mal relative parameter estimation for a rotational degree 
of freedom given a pair of spin systems. Note that our 



problem is complementary to that of determining the op- 
timal measurement schemes for estimating collective pa- 
rameters for a rotational degree of freedom, a subject of 
many recent investigations flol I Kit Il7j| . 

One scheme for estimating such relative parameters is 
to measure each system independently with respect to 
an external RF, e.g., to perform an optimal estimation 
of each spin direction and then to calculate the angle 
between these estimates. We prove that any such local 
scheme, performed using only local operations and classi- 
cal communication (LOCC), cannot be optimal; the abil- 
ity to perform Joint measurements is necessary to achieve 
the optimum We also prove that the optimal mea- 
surement for estimating a relative angle can be chosen to 
be rotationally- invariant, demonstrating that an external 
RF is not required. We investigate the information gain 
that can be achieved as different aspects of the estima- 
tion task are varied, such as the prior over the relative 
angle or the magnitude of the spins. 

Previous studies into parameter estimation have not 
considered the role of the RF (implicitly presumed to be 
classical) ; our development of relative parameter estima- 
tion is appropriate for the case where the RF is itself 
quantized. We investigate quantum-classical correspon- 
dence of RFs by considering the limit in which one of the 
spins becomes large, and demonstrate that our optimal 
relative measurement yields the same information gain in 
this limit as does the optimal measurement for estimating 
a spin's direction relative to a classical RF. Interestingly, 
we also find that the need for joint measurements disap- 
pears in this limit. These results contribute to our under- 
standing of how collective degrees of freedom, which are 
defined with respect to a classical RF, can be treated as 
relative ones between quantized systems. Such an under- 
standing is likely to be critical for quantum gravity and 
cosmology, wherein all degrees of freedom are expected 
to be relative [l9| . 

II. RELATIVE PARAMETER ESTIMATION 
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Consider states in the joint Hilbert space Mj 1 (&Wj 2 of a 
spin-ji and a spin-j2 system. This Hilbert space carries a 
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collective tensor representation R(Cl) = Rj 1 (fi) ® i? j2 (it) 
of a rotation SI G SU(2) where each system is rotated 
by the same amount. We can parametrise the states in 
Wj 1 (£> Mj 2 by two sets of parameters, a and 51, such that 
a state p a ,n transforms under a collective rotation as 

R(n')p a s,R(Q'^ = Pa ^n ■ (1) 

Defining a collective parameter as one whose variation 
corresponds to a collective rotation of the state, and a 
relative parameter as one that is invariant under such a 
rotation, we see that a is relative and ft is collective. 

Note that a typical parameter will be neither collec- 
tive nor relative. However, in situations where a super- 
selection rule for the group of collective transformations 
applies, or when all systems that can serve as a classical 
RF for the collective degrees of freedom have been quan- 
tized, one finds that all collective parameters become op- 
erationally meaningless, and all observable parameters 
are relative. 

In the special case of two spins each prepared in an 
SU(2) coherent state [2(j (discussed below), there is only 
a single relative parameter: the angle between the two 
spins. Note that this angle cannot be perfectly deter- 
mined by a single measurement (there exist sets of states 
with different values of this relative parameter that are 
nonorthogonal) and thus we refer to information about 
this relative parameter as quantum. 

Suppose that Alice prepares a pair of spins in the state 
Pa.n and Bob wishes to acquire information about the 
relative parameters a without having any prior knowl- 
edge of the collective parameters ft. The most general 
measurement that can be performed by Bob is a positive 
operator valued measure (POVM) |2J represented by a 
set of operators {^a}- Upon obtaining the outcome A, 
Bob uses Bayes' theorem to update his knowledge about 
a, f2 from his prior distribution p(a, to his posterior 
distribution: 



p(a,Q\X) 



Tr(_E A Pa,a)p(a, 



where 



Tr(Exp a ,n)p(a, ft) da dfl . 



(2) 



(3) 



Assuming that Bob has no prior knowledge of fi, we may 
take p(a, ft) da dfl = p(a) da dfl where p(a) is Bob's 
prior probability density over a and dfl is the SU(2) in- 
variant measure. 

Any measure of Bob's information gain about a can 
depend only on the prior and the posterior distributions 
over a for every A. The latter are obtained by marginal- 
ization of the p(a, fl\X), and are given by 



p(a\X) 



Tr(E\p a )p(a) 
P(A) 



where 



j a = J R{fl')p atn R(n')hm' . 



(4) 



(•5) 



For a given POVM {E x }, note that any other POVM re- 
lated by a collective rotation (i.e., E' x — R(fl)E\R(fl)^) 
yields precisely the same posterior distributions over a. 
This property also holds true for the POVM with ele- 
ments E\ — J R(fl)E\R(fl) if dfl, which is rotationally- 
invariant, that is, 



R{fl)E x R(fl)^ = E\ , VSle SU(2) 



(6) 



We define POVMs that yield the same posterior distri- 
bution over a to be informationally equivalent. For ev- 
ery POVM, there exists a rotationally-invariant POVM 
of the form (jSJ that is informationally equivalent, and 
thus it is sufficient to consider only rotationally-invariant 
POVMs in optimizing Bob's choice of measurement. 
These can be implemented without an external RF for 
spatial orientation. Moreover, they have a very particu- 
lar form, as we now demonstrate. 

The joint Hilbert space for the two spins decomposes 
into a multiplicity-free direct sum of irreducible repre- 
sentations (irreps) of SU(2), i.e., eigenspaces Mr of total 
angular momentum J. Using Schur's lemma j22j . it can 
be shown that any positive operator satisfying (jSJ can 
be expressed as a positive- weighted sum of projectors Tlj 
onto the subspaces Mj, that is, as E\ = X)j s a,jIIj, 
where s\,j > 0. In order to ensure that E\ = I, wc 
require that Ylx S -M = •"■> so ^ na ^ S *-J ^ s a probability 
distribution over A. The {E x } can be obtained by ran- 
dom sampling of the projective measurement elements 
{rij}, and such a sampling cannot increase the informa- 
tion about the relative parameters (quantified by some 
concave function such as the average information gain). 
Thus, the most informative rotationally-invariant POVM 
is simply the projective measurement {n,/}. 

We have proved the main result of the paper, which 
can be summarized as follows: If the prior over collective 
rotations ft is uniform, then for any prior over the relative 
parameters a, the maximum information gain (by any 
measure) can be achieved using the rotationally-invariant 
projective measurement {nj}. 

A useful way to understand this result is to note that 
our estimation task is equivalent to one wherein Alice 
prepares a state p a (rather than p a .n)- Because the p a 
are rotationally-invariant, they are also positive sums of 
the Tlj and thus may be treated as classical probability 
distributions over J. The problem reduces to a discrim- 
ination among such distributions, for which Bob can do 
no better than to measure the value of J. 

We now apply this result to several important and 
illustrative examples of relative parameter estimation. 
We shall quantify the degree of success in the estima- 
tion by the average decrease in Shannon entropy of the 
distribution over a |2l|. which is equivalent to the aver- 
age (Kullback-Leibler) relative information between the 
posterior and the prior distributions over a, specifically 
Jav = J2\PWh, where 



h = / p(a\X) log 2 [p(a\ X)/p(a)] da . 



(7) 



3 



We refer to this quantity as simply the average informa- 
tion gain. 



A. Two spin-1/2 systems 

The simplest example of relative parameter estima- 
tion arises in the context of a pair of spin-1/2 systems. 
Alice prepares the product state |ni) <g) |ri2), where |n) 
is the eigenstate of J ■ n with positive eigenvalue (note 
that every state of a spin-1/2 system is an SU(2) coher- 
ent state). Bob's task is to estimate the relative angle 
a = cos _1 (n! • n 2 ) given no knowledge of the collective 
orientation of the state. Because the joint Hilbert space 
decomposes into a J = and a J = 1 irrep, the optimal 
POVM has the form {n A ,U s }, where U A = I^X* - is 
the projector onto the antisymmetric (J = 0) subspace 
and lis — I — is the projector onto the symmetric 
(J = 1) subspace. The conditional probability of out- 
comes A and S given a are simply 



p(A\a) = Tr(U APa ) = \ sin 2 (a/2) , 
p(S\a) = 1 — p(A\a) . 



(8) 



The average information gain and the optimal guess for 
the value of a depend on Bob's prior over a. We consider 
two natural choices of prior. 
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FIG. 1: Average information gain for measurements on a 
spin-1/2 system and a spin-j system. The curves (a),(b) cor- 
respond to the optimal measurement and the optimal local 
measurement for the case when the spins are prepared paral- 
lel or antiparallel with equal probability. The curves (c),(d) 
correspond to the optimal measurement and the optimal local 
measurements for the case when the initial direction of each 
spin is chosen uniformly from the sphere. 



1. Parallel versus anti-parallel spins 

This situation corresponds to a prior p(a—0) = 
p(a=Tr) = 1/2, yielding p(A) = 1/4, p(S) = 3/4 and 
posteriors 

p(a=0\A)=0, p(a=ic\A) = 1 , 

p(a=0\S) = 2/3 , p(a=ir\S) = 1/3 . (9) 

Upon obtaining the antisymmetric outcome, Bob knows 
that the spins were anti-parallel, whereas upon obtaining 
the symmetric outcome, they are deemed to be twice as 
likely to have been parallel than anti-parallel. We find 

I A = 1 , Is - | - log 2 3 ~ .08 170 , (10) 

i.e. 1 bit of information is gained upon obtaining the anti- 
symmetric outcome, and 0.08170 bits for the symmetric 
outcome. On average, Bob gains 7 av = \l A + jls — 
0.3113 bits of information. 



2. Uniform prior for each system's spin direction 

In this case, the prior over a is p(a) = ^sina. This 
implies posteriors 

p(a|j4) = sin 2 (a/2) sin a , 

p(a\S) = i(2- sin 2 (a/2)) sin a, (11) 



which are peaked at 27r/3 and 0.4094-7T respectively. 
It follows that these are the best guesses for the an- 
gle a given each possible outcome. Using the posteri- 
ors, we find I A ~ 0.2786, I s ^ 0.02702, which yields 
I av ~ 0.08993. Less information is acquired than in the 
parallel-antiparallel estimation problem, because angles 
near 7r/2 are more difficult to distinguish. 



B. One spin-1/2, one spin-j system 

We now consider the estimation of the angle between 
a spin-1/2 system and a spin-j system for some arbitrary 
j, where the latter is in an SU(2) coherent state |jn) 
(the eigenstate of J • n associated with the maximum 
eigenvalue) [2(j. Alice prepares |ni) <g) |jri2) and Bob 
seeks to estimate a = cos _1 (ni ■ n 2 ). The joint Hilbert 
space decomposes into a sum of a J = j + 1/2 irrep 
and a J = j — 1/2 irrep. The optimal measurement 
is the two outcome POVM {n + ,n_}, where n± is the 
projector onto the j ± 1/2 irrep 1 . Using Clebsch-Gordon 
coefficients, the probabilities for each of the outcomes are 



This measurement is identical to the one described in |l 4l for 
optimal programmable measurements. 
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found to be 



p(-\a) = Tr(n_p Q ) = ^ sin 2 (a/2) , 
p(+\a) = 1 -p(-\a) . 



(12) 



We again consider two possible priors over a. 



1. Parallel versus anti-parallel spins 

A calculation similar to the one for two spin-1/2 sys- 
tems yields the posteriors 



p(a=0\+) = 



2j+l 



2j + 2 
p(a=0\-) = 0, 



p(a=7r\+) = , 

^ V 17 2j + 2 ' 

p(a=*\-) = l. (13) 



Using these, we can calculate the average information 
gain as a function of j; the result is curve (a) of Fig. ^ 
The j — 1/2 value is the average information gain for 
two spin-1/2 systems, derived previously. In the limit 
j — > oo, p(a—0\+) — » 1 and p(a=Ti\+) — > so that the 
outcome of the measurement leaves no uncertainty about 
whether the spins were parallel or antiparallel, and the 
average information gain goes to one bit. Thus, in the 
limit that one of the spins becomes large, the problem 
becomes equivalent to estimating whether the spin-1/2 
is up or down compared to some classical reference di- 
rection, where one expects an average information gain 
of one bit. 



2. Uniform prior for each system's spin direction 

Following the same steps as before, the average infor- 
mation gain can be derived as a function of j; the re- 
sult is curve (c) of Fig. ^ In the limit j — > oo, we find 
7 av = 1 - (21n2) _1 ~ 0.2787 bits, which is precisely the 
information gain for the optimal measurement of the an- 
gle of a spin-1/2 system relative to a classical direction 
given a uniform prior over spin directions pl| . 



C. Optimal local measurements 

Consider again the simplest case of a pair of spin- 
1/2 systems. The optimal measurement in this case was 
found to be the POVM {n^IIs}. This measurement 
cannot be implemented by local operations on the in- 
dividual systems because 11^ is a projector onto an en- 
tangled state. We now determine the optimal local mea- 
surement. We do so by first finding the optimal separable 
POVM (one for which all the elements are separable op- 
erators), and then showing that this can be achieved by 
LOCC. (Not all separable POVMs can be implemented 
using LOCC H^-) The rotationally invariant states for a 
pair of spin-1/2 systems, called Werner states |24|, have 
the form 



and are known to only be separable for p < 1/2 |25| . 
Thus, the greatest relative weight of 11^ to Ilg that can 
occur in a separable positive operator is 3. The clos- 
est separable POVM to the optimal POVM {ILa, n s } is 
therefore {EU + ±n s , |n s }. We note that this POVM 
is informationally equivalent to measuring the spin of 
each system along the same (arbitrary) axis and regis- 
tering whether the outcomes are the same or not, which 
clearly only involves local operations (and does not even 
require classical communication). Because the POVM 
{11^4 + jllg, §IIs} can be obtained by random sampling 
of the outcome of {n^, lis}, the former is strictly less in- 
formative than the latter. Indeed, the maximum average 
information gain with the optimal local measurement is 
0.0817 bits for case (1) above, and 0.02702 bits for case 
(2), both strictly less than those obtained for the optimal 
(joint) measurement. 

We extend this analysis to the spin-1/2, spin-j case. 
Consider the following LOCC measurement. The spin- 
j system is measured along the complete basis of SU(2) 
coherent states {|jn m )} m where m = 0, . . . , 2j and n m 

in some fixed but arbi- 



2j+l 



points at an angle 6„ 

trary plane. Then, conditional on the outcome m of this 
measurement, the spin-1/2 system is measured along the 
basis {|n m ), |— n m }}. The measurement outcome of the 
spin-j system is then discarded, and all that is registered 
is whether the outcome for the spin-1/2 system is ±n m ; 
i.e., whether the two spins are aligned or anti-aligned. 
The resulting 2-outcome measurement is informationally 
equivalent to the rotationally invariant POVM 



III 

n 2 



2j + l 
2j + 2 

n 



n 



+ > 



2j 



-n 4 



(15) 



pw = p&a + (i - p)ns/3 , 



(14) 



By numerically calculating the partial transpose of the 
operator n +aTI + , the negativity of which is a necessary 
condition for non-separability |2g], we find that {ni, 
is the optimal separable POVM. Thus, again, the opti- 
mal separable POVM can be implemented by LOCC and 
gives less information than the optimal (joint) measure- 
ment. The average information gain achieved by this 
measurement, as a function of j, in cases (1) and (2) 
are plotted as curves (b) and (d) of Fig. ^ Note that 
the optimum information gain overall can be achieved 
by LOCC in the limit j — > oo. 



III. DISCUSSION 

We now briefly discuss some other relative parameter 
estimation tasks for which our result provides the solu- 
tion. The case we have yet to address is the estimation of 
the angle between a spin-ji and a spin-j2 system, both 
in SU(2) coherent states, for arbitrary ji, j%. Assum- 
ing J2 > ji, the optimal measurement is the (2ji + 1)- 
element projective measurement which projects onto the 
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subspaces of fixed total angular momentum J. The pos- 
terior distributions over a and the average information 
gain can be calculated as before, although in this case 
they are much more complicated. However, in the limit 
]2 — * oo, the Clebsch-Gordon coefficients simplify, and 
one can show that the probability of a measurement out- 
come J approaches the probabilities obtained using the 
Born rule for a projective measurement along the classi- 
cal direction defined by the spin-j 2 system. Thus, the 
posterior distribution for any measurement result will 
agree with what would be obtained classically, regard- 
less of the prior over a. If, in addition, we take j± — ► oo, 
the information gain for a becomes infinite (for any prior 
distribution) and thus a can be inferred with certainty 
from the measurement result, as expected for a measure- 
ment of the angle between two classical directions. Our 
results also indicate that, in the classical limit, a mea- 
surement of the magnitude of total angular momentum 
should be sufficient to estimate the relative angle, which 
is indeed the case if the magnitude of each spin is known. 

It should be noted that estimating the relative angle 
between a pair of SU(2) coherent states is of particular 
importance because estimating the eccentricity of an el- 
liptic Rydberg state of a Hydrogen atom is an instance 
of the same problem [23] . Rydberg states are significant 
because they can be prepared experimentally. Our re- 
sults imply that an optimal estimation of eccentricity is 
in fact straightforward to achieve experimentally because 
it involves only a measurement of the magnitude of the 
total angular momentum of the atom. 

Our results are also applicable to systems other than 
spin. For example, for any realization of a pair of 2-level 
systems (qubits) , the degree of nonorthogonality between 
their states (measured by, say, the overlap KV'ilV^)!) is 
invariant under collective transformations and is thus a 
relative parameter. Our measurement is thus optimal for 
estimating this nonorthogonality. 

In addition to solving various estimation problems, we 
have shown that a macroscopic spin in the appropriate 
limit is equivalent to a classical external RF as far as 



relative parameter estimation is concerned. This result 
suggests that it may be possible to express all measure- 
ments (and possibly all operations) in a covariant, rela- 
tive framework that respects the underlying symmetries 
of the theory. Such a framework is necessary if one wishes 
to abide by the principle, which has been so fruitful in 
the study of space and time but has yet to be embraced 
in the quantum context, that all degrees of freedom must 
be defined in terms of relations. 

There remain many important questions for future in- 
vestigation. While we have focussed on estimating rel- 
ative parameters of product states, one can also con- 
sider relative parameters of entangled states, and here 
the landscape becomes much richer. For instance, for a 
pair of spin- 1/2 systems, while the set of product states 
supports a single relative parameter, the set of all two- 
qubit states supports three: the angle between the spins 
in a term of the Schmidt decomposition |2l| , the phase 
between the two terms of this decomposition, and the 
degree of entanglement. Our measurement scheme is op- 
timal for estimating these relative parameters as well. 
Given the significance of entanglement for quantum in- 
formation theory, there is likely much to be learned from 
investigating other sorts of relative quantum information. 
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